Scene Graph Generation


Scene graph generation is the process of creating structured representations of scenes that capture the relationships between objects.

OR-Action: Multi-Role Video Understanding with Fine-Grained Actions

Add code
Jun 11, 2026
Viaarxiv icon

Plan-and-Verify Video Reward Reasoning with Spatio-Temporal Scene Graph Grounding

Add code
Jun 10, 2026
Viaarxiv icon

AllDayNav: Lifelong Navigation via Real-World Reinforcement Learning

Add code
Jun 09, 2026
Viaarxiv icon

HDSL: A Hierarchical Domain-Specific Language for Structured 3D Indoor Scene Generation and Localized Editing with LLM Agents

Add code
Jun 08, 2026
Viaarxiv icon

Visual Commonsense Driven Knowledge Refinements for Scene Graph Generation

Add code
Jun 04, 2026
Viaarxiv icon

QPredSGG: Hybrid Quantum Predicate Learning for Long-Tailed Scene Graph Generation

Add code
Jun 03, 2026
Viaarxiv icon

HyperVis: Continuous Latent Visual Relational Graphs on the Lorentz Hyperboloid for Compositional Reasoning

Add code
Jun 04, 2026
Viaarxiv icon

Seeing Fast and Slow: Bimodal 3D Scene Graphs for Open-set Tasks

Add code
May 29, 2026
Viaarxiv icon

Narrative Knowledge Weaver: Narrative-Centric Retrieval-Augmented Reasoning for Long-Form Text Understanding

Add code
Jun 04, 2026
Viaarxiv icon

Dive into the Scene: Breaking the Perceptual Bottleneck in Vision-Language Decision Making via Focus Plan Generation

Add code
Jun 02, 2026
Viaarxiv icon